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ABSTRACT 

A canonic system C is a specification of a recursively enumerable set, 
such as a set of strings over a finite alphabet. From this description 
C, it is possible to generate a system C m> called a proof measure func- 
tion, which is an indication of the complexity of the language defined. 
For certain simple but important classes of canonic systems, algebraic 
bounds on these functions can be derived from the structure of the sys- 
tem. Another transformation on C produces a system C which character- 
izes the recognition of strings generated by C. A relationship exists 

between the measure functions of C and c" , thus relating the complexity 
of the recognition procedure to that of the language description. 
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Chapter 1 - Review of Canonic Systems 

Introduction 

A canonic system is a recursive definition of the members of a set. 
Instead of enumerating its elements, we specify a set G by first giving 
a finite number of basic members, then rules of the form "If x is in G, 
then /(x) is also in G." Canonic systems provide the framework for 
expressing such a procedure in a formal way. 

For example, let G - {a | k is an even integer} be a set of strings 
over the alphabet {a}. Membership in G can be allowed thus: 

1. aa e G 

2. If x e G, then xaa e G 

3. No other strings are in G. 

A canonic system generating G could be written 

1. (- aaG 

2. xG \- xaaG 

where "\- aaG" is another form of the membership axiom "aa e G," and 
"<premise> }- <conclusion>" is a contraction of the "If . . .then. . ." 
statement. The fact that no other strings are to be allowed in G is 
implicit in the way the canonic system is to be interpreted. 

Definition 1 . A canonic system C is a 5-tuple 

C - (S, N, P, R, C) 



where 



S = an algebraic system (A, *,...,*) consisting of a set A 
and a collection of operations *,...,* on A. 

N = a finite set of variable symbols, usually {x, y,.,.,z}. 



P = a finite set of predicates, each of which is assigned a 
unique positive integer called its degree. 

R ■ a finite set of rules of inference, called canons, whose 
form will be described in Definition 2. 

C = a canonic system which defines the interpretation of the 
rules of inference in C; its structure and significance 
will be discussed in the next section. 

Previous formulations of canonic systems have tacitly assumed the 
algebraic system S to be the monoid (V , •) of all finite strings over 
an alphabet V under the operation of concatenation. By making the 
algebraic system explicit, we are able to specify canonic systems gen- 
erating sets of integers, real numbers, and so on, as well as sets of 
strings. 

Definition 2 . A canon is a rule of inference of the form 

f,(x ,.. .,x )F. A ... A f (x n ,... ,x )F U g(x, ,...,x )G 

*■ i. n ~~i ml n —m ' l n — 

where F , ..., F , and G are predicates in P; and f , , ..., f , and g 

1 m r 1 m 

are functions 
would be read; 



are functions on A formed f rom * , . . . ,* by composition. This canon 



If fj(x.,...,x ) has the property F. , and ..., and 

f (x.,...,x ) has the property F , then g(x-,...,x ) has 
m j. n * m in 

the property G. 

If Q is interpreted as the extension of a predicate, Q ■ {x | Q(x)} , 
then the meaning of a canon may be defined as 

G = {u | a z. , ... ,z e A [f, (z, .... ,z ) e F. A ... 

in ij. n l 

A f m (z lf T?Sy> « F m A g( 2l ,...,z n ) - u]}. 
The quantity /(x., . . . ,x.)Q is called a remark ; the remarks 



f.(x„,...,x )F. are the pr emises of the canon, and g(x, ,...,x )G is 
1 1 n — 1 r 1 n 

its conclusion . A remark /(x 1> . . . .x^) £ is said to be true iff there 
exist z.,...,z. e A such that /(z.,...,z.) e Q. 

* 
For example, consider this canon over the algebraic system (V , •)» 

where V is a finite alphabet and • is the concatenation operation: 
xB |- axaC 

This is interpreted as, "If x has the property B, then the string axa 
has the property C." Formally, 

C ■ {u| 3 z e V [z 6 B A u - aza]} 

= {axa | x e B} if B is a non-empty set of strings over V . 

Another canon might be 

xaD A yE |- <x , ya> F 
whose interpretation is 

F = {<u, v>| ax.yeV [xa€D A yel A <u, v> = <x, ya>]} , 

where the notation <u, v> denotes the ordered pair of u and v. Infor- 
mally, we are searching D for a string that ends in "a," then forming 
an ordered pair from part of it and any string from E. 

An axiom is a canon with no premises; it is therefore universally 
true. If a function g(x 1 ,...,x ) appears in an axiom, then x.,...,x 
must be allowed to range over all elements of A. Axioms might be |- 2N 

or f-x Si 

Definition 3 . A remark uG is immediately derived from a sequence 

of remarks dd,F, ,..., o)_F , u. € A, iff 
1— i nnn i 

1. f t (x. , . . . ,x JF. A ... A f (x.,...,x )F h g(x , ...,x )G 
11 n —1 ml n — m l n 

is a canon. 



2. a z.,.,.,2 e A such that f^z^...^) * u ±t and 



gCz^...^) ■ w. 



Definition 4 . A proof of a remark r is a finite list of remarks 

r, , . .. ,r such that 
1 P 

1. t is an axiom, 

2. r (1 < i £ p) is either an axiom or a remark immediately 
derivable from some subset of {r , . . . '» x j__j_i » 

3. r = r. 

P 

To illustrate these definitions, let us consider the system 
C - (({a, b)*, •), {x], {A}, R, C*), where the canons in R are 



h bA . 

xA |- axaA . 

This system contains the single axiom |-bA. so it is trivially provable 
that bA. A single application of the canon xA h axaA yields the con- 
clusion abaA. Continuing this process, we obtain the proof 

bA, abaA, aabaaA, ..., a ba A . 

Thus A = {a n ba n | n ^ 0} , a context-free language. 

We occasionally use the phrase "the language (or the set) generated 
by the system C" to mean the set G defined by some predicate G in C. 
Either the appropriate predicate will be obvious, or the selection of G 
will depend upon the situation in which the result is applied. 



Hierarchy of Canonic Systems 

A proof in a formal system usually consists of a finite number of 
repetitions of two operations: 

1. Substituting elements of A for variables, 



2. Inferring Y from X and X -. Y. 

These are operations which can be carried out by a mathematician with 
paper and pencil. In order to be able to formalize the idea of a 
proof, Moore and Donovan [ 3 ] allowed a canonic system C to describe 
the concept of a "proof" in a system C. A hierarchy is thus formed, 
since a proof in C is, in turn, described by a system C". For some k, 
c ( ) = c' , so that no new features will be found in higher level 
systems. 

For example, consider the system 

C = (({a, b}*, .)» [*}> {A}, R, C), where R consists of the 
canons 

1 . h bA 

2. xA (— axaA. 

X = {b, aba, aabaa, ...}. The second level system C will generate the 
proofs that each of these strings is defined by A. In other words, C 
will generate the set 

{bA, bA;abaA, bA;abaA;aabaaaA, ...} 

where we have used the semicolon only to avoid a collision of punctuation. 

This set can be considered a set of strings over the alphabet 
{b, A, ;}, and as such it may be generated by some canonic system. 
Define C to be that system. Its canons describe the effect on a proof 
of applying each of the rules of C: 

1. \-\ proof 

2. rv proof \- a ;bA jgroojE 

3. a proof a P;yA* proof f* aWaAs P roof 

Then C = (({;, b. A}*, .) , {a, P, Y, S }, {proof}, R, C") . It is not 
difficult to see how C operates. First, for convenience, it defines 



10 



the null string as a proof. Then, if some proof a has already been 
derived, a followed by the symbols "jbA" is still a proof because hbA 
is an axiom of C. Finally, if a proof a exists such that the pattern 
";<some string y> A" appears within it, a followed by the symbols 
"; a<some string Y>a A" is also a proof. This corresponds to applying 
the canon xA |- axaA in C. 

This formulation of higher-level systems is considerably simpler 
than their original description [ 4 ]. Since we shall have occasion 
to use such systems later, this transparency will be beneficial. 

Summary 

This chapter has been a review of the definitions and terminology 
associated with canonic systems. We have revised some of the formulations 
so that they are applicable over arbitrary algebraic systems; this will 
allow us to write canons describing, for example, sets of integers, as 
well as sets of strings. 
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Chapter 2 - Proof Measure Functions 
Introduction 

This section davelops a measure of complexity for sets of strings 
specified by a canonic systems. Such a measure is related to the time 
it would take a generator operating from the canonic system to produce 
an element of the language being described. 

Definition 1 . A proof of a theorem t in a system C is a finite 
list of remarks ^.....r such that 

1. r is an axiom 

2 - r i C 1 < i £ k) is either an axiom or a theorem 

immediately derivable from some subset of fr r h 

1 1 i-l J 
3. r, « t. 
k 

Recall that, in a given canonic system C, the meaning and derivation 
of a proof is given in the next higher level system C . Since C' is 
capable of generating all possible proofs of theorems in C, we make the 
following definition. 

Definition 2. The proof measure function m(C , t) of a theorem t 
in a canonic system C is the number of remarks in the shortest proof of 
t. That is, if r^ r 2 >---> r k ls the shortest proof of t in C, then 
m(C, t) = k. 

For example, in a system S containing the canons 

1. h bA 

2 . xA \- axaA 

we would have the proof bA, abaA, aabaaA. Therefore m(S, aabaa) - 3 
since there is precisely one proof for each string in this system. 
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Properties of m(C, t) 

We may now prove several properties of proof measure functions. 

Theorem 1 . If t is a theorem of C, then m(c , t) is computable. 

Proof. If t is a theorem of C, then there exists a proof of 
t in C: r. , r_,...,r, such that t = r, . This proof, considered as a 
string of symbols, is a string derived by the next higher level system C . 
Modify C' into C so that it keeps count of the length of the proof it 
is constructing for t. Change the axiom 

|~ \ proof 

into 

(- <K, 0> proof & length . 

In each canon of C which places a remark in the proof, replace the 
predicate proof with a predicate over ordered pairs, proof & length . 
Thus, if C has the canon 

a proof A • • • p- a;P proof , 

let C have the canon 
m 

<Cx, n> proof ft length a. . . \- <U$ » n+l> proof & length . 

Finally, let C have the canon 
m 

<o;P, n> proof & length f-<P» n> theorem & m(c, t) 

to associate with each t the length of its proof. Then the canonic 
system C computes m(C, t) . QED 

An Alternative Function 

Given that t is a theorem of C, the function m(C, t) gives us a 
measure of the difficulty of proving t in C. Unfortunately m is a 
function of each possible theorem. It would be desirable to sacrifice 
some of the exactness of m in favor of a more macroscopic quantity. 
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Definition 3 . Let m' (C, n) - sup {m(c, t) | t is provable in 
C and |t| = n}. 

The function m' looks at all proofs of theorems of length n in C 
and takes the number of steps in the longest proof as its value. Thus, 
given a theorem t of length n, we can, under certain conditions, say 
that it may be proven in no more than m' (c , n) steps. By definition, 
m 1 (C , n) is the upper bound on the length of proof of a theorem of 
length n. We now consider the conditions under which it is meaningful 
to speak of m' . 

Theorem 2 . m'(c, n) is computable if and only if C generates a 
recursive set. 

Proof . Consider the system C to be over a terminal alphabet T. 
Generate the set 

•k 

S = (w e T | to = n} , 

the set of all possible formulas of length n over T. For each w e $ n > 
we can determine whether or not it is a theorem in C since C generates 
a recursive set. That is, generate 

S ' - fu e S I a) is provable in C}. 
Then, since m*(C, w ) is computable when u is known to be provable in C, 
m'(C, n) « sup {m(C, u)|u e s ' n 3' 

If m' (C, n) is computable, then if any formula a is provable at 
all, it is provable in no more than m' (C, |a|) steps. Since C contains 
a finite number of axioms and canons, it is possible to generate all 
proofs of length 1 (the axioms), length 2 (one canon applied to each 
axiom), and so on. If the theorem a has not been proven after m'(C, |a| ) 
steps, it will never be. Since the membership question is thus decidable, 
C must generate a recursive set. QED 

There is a property of m' which is significant for its application 
to complexity measures for programming languages. 
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Theorem 3 . m'(C,n) Is not necessarily monotonic increasing in n. 
Proof . Consider the counterexample 

\- aaA 

xA f- xaaA 

h fl B 

xB |- xaaB 

xB h xA 

which generates the proofs 

1. aB, aA 

2. aaA 

3. aB, aaaB, aaaA 

4. aaA, aaaaA. 

Then m'(C,l) = 2, m' (C,2) - 1, m'(C,3) - 3, m'(C,4) - 2, and so on. 
QED 

This property is entirely expected, for some short theorems may 
be much more difficult to prove than long ones. 

A More Precise Function 

The functions m and m' evolve naturally from a consideration of 
the process of proof in a canonic system. They do provide an indication 
of the difficulty of deriving a given theorem, but they make the 
simplifying assumption that one canon is no different from another 
since each extends the proof one step. But certain rules may contain 
several premises, each of which must be examined before the conclusion 
may be drawn. Let us define another function which takes this into 
account. 

Definition 4 . The proof measure function n(C, t) of a theorem t 
in a canonic system C is the total number of premises evaluated in the 
shortest proof of t. That is, if * 1 »"«»* k is the shortest proof of 
t in C, and if the canons applied during this proof were 



15 



r n Ar n A...Ar m ^ h ^ 
r l2 Ar 22 A '-^\z h r 2 



r lk Ar 2k A -" Ar m k k ^ V v 

k 

then n(C,t) = zl m . 
j = l J 

Theorem A . n(C, t) is computable iff t Is provable in C. 

Proof. The proof Is almost identical to that of Theorem 1. Build 

the canonic system C so that, at each step in the proof of t, a count 

m 

is kept of the number of premises which have been evaluated so far. QED 

Definition 5 . Let n'(C, j0 - sup{n(C, t) | t is provable in 
C and |t|=* i). 

Theorem 5 . n'(C, I) is computable iff C generates a recursive set. 
Proof . Similar to that of Theorem 2. QED 

Theorem 6 . If p is the number of axioms appearing in the shortest 
proof of t in C, then nCC*"*) Si m(C, t) - p. 

Proof . If p axioms appear in the proof, then m(C, t) - p + x, where 
x steps are taken in ithe proof after the axioms are written down. The 
number of premise remarks evaluated for each of the axioms is zero; for 
the x remaining steps, y is x premises must be examined since each canon 
has at least one premise. Thus n(C, t) « + y. These three conditions 
imply n(C, t) * m(C, t) # p. QED 

Theorem 7 . If C is a canonic system possessing q axioms, then 
n'(C, jfc) :» m'(C, I) - q. for all A. 

Proof . At worst, some theorem of length t may require the use of 
all q axioms in its proof; in any case, it requires at least one. The 
conclusion follows from this observation and Theorem 6. QED 



16 



Bounds on Measure Functions 

Consider a canonic system C generating sets of strings. We would 
like to relate at least one of the proof measure functions to the 
structure of the system C in such a way that we could avoid having the 
function described by only another canonic system. Let 

D(k) » fx | x is provable in exactly k steps in C] . 

Then, by definition, D(l) is the set of all the axioms of C, D(2) is the 
set of all theorems provable by applying exactly one canon to the strings 
in D(l), and so on. Furthermore, let 

L(k) = sup {x| x e D(k)}, 
S(k) = inf {x| x e D(k)}. 

That is, L(k) is the length of the longest theorem provable in k steps, 
and S(k) is the length of the shortest. 

Assume that the canonic system C has the property that no canon 
ever "shortens" a theorem. That is, if t is provable in k steps and t* 
in k + 1, then |t*| fc |t| . Under these conditions, we can say that 
L(k + 1) & L(k) and S(k + 1) a S(k) for all k. 

Recall that L(k) is the length of the longest string possible after 
a derivation of k steps. Then the inverse function L (z) is the mini- 
mum number of proof steps required to generate any theorem of length z; 
all proofs of fewer steps yield strings shorter than z. Now the proof 
measure function m'(C, z) is, by definition, the maximum number of proof 
steps required to generate all theorems of length z. We therefore obtain 

Lemma 1 . L (z) 3 m'(C, z) if C is non-erasing. 

A similar argument may be advanced for the S(k) function. Since 
S~ 1 (z) is the maximum number of proof steps required to generate all 
theorems of length z, all longer proofs yield strings of length greater 
than z. This observation produces 

Lemma 2 . m'(C, z) s S* (z) if C is non-erasing. 
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and 



Theorem 8 . If C is a canonic system with the property that t 
provable in k steps and t* in k + 1 implies |t'| 2; |t|, then 
L _1 (z) <: m'(C, z) s: S -1 (z) for all z. 

Computation of L(k) and S(k) 

Consider a system C generating strings over an alphabet V in 
which each canon is either an axiom or a rule of the restricted form 

xF f- u) xu 2 G 

where w u 9 e V . Let I be the length of the longest axiom in C 
(X - sup {|w| I )- oj is an axiom of C]) and I be the maximum number of 
symbols added to a derived string by the application of a single canon 
(4 = sup {l^^al I *F h WjXWoS is a c& n° n of c }) • Then we ° bserve that 

L(l) - 4 o 

L(2) - L(l) + I 



L(k) = L<k-1) + l 



so that 



L(k) = i Q + (k-l)je. 

Similarly, if s q - inf { |u| ||-u is an axiom of C} and 
s * inf {|w,w 2 | I *£■ •" (j0 i xW 2- ls a canon of c } describe the shortest 
starting string and the least number of symbols added by any canon, then 

S(k) ■- s + (k-l)s. 
The inverses are easily found to be 



l" 


•'w 


as 


Z ■ 


■ V 


+ 


I 


s" 


z ■ 


A 

- 8 



+ 


s 
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Noting that canons of the form used never "shorten 11 a theorem, we may 
combine these results with Theorem 8 to yield 

Theorem 9 . If C is a canonic system in which all canons are either 
of the formf-w or xF (- w-xw-G, then 

z "A„+j4^ ,,„ * z-s +s 



o ' " £m'(C, z) g o , where j| , £, s , s are as 



I a 

defined previously 



The class of canonic systems yielding exactly the regular sets 

requires canons of the form either [- aG or xF f- xaG, where a is a single 

symbol from V. Thus i « £ = s =s-l, and Theorem 2 produces 

o o 

z sj m' (C, z) s z. We have proven what may be obvious: 



Corollary 1 . If C defines a regular set, then m' (C , z) = z. 

Now consider a system C generating strings over an alphabet V in 
which each canon is either an axiom or a rule of the restricted form 

X..F, A-.-AxF t- u x. wx ... to ,x, WG 
li n-n ■ o i, 1 i n-1 i n— 

where u^ e V , * L s {x^ .. ,,x }, and n > 1. 



Define 

1. r 

max 

is a canon of C] 



^ = sup £n| * lEl A x 2 F 2 A-.-Ax^h ^.-^G 



2 - r min = inf f n I X £l A X 2^2 A * * ' A *A >" g(X l' " ' ' >X n )G - 
is a canon of C} 

3. i Q = sup {|w| J \-u is an axiom of C} 

4. x- sup (lu)^ ... wj | x^ a- .-AX^ |- 

w x, w.,x, . . . u ,x. w G is a canon of C) 
o i. 1 i. n-1 in— J 

12 n 

5. s = inf {ju| |Hw is an axiom of C} 
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s = 



= inf Cl^l •"• "n 1 I *lh A "- AX A •" 



u- x. w x. . . . u) .x. w G is a canon of C and n = r . ] , 
o i. 1 L n-1 l n— min J 

I z n 



The definitions of £ , jj, s , and s are similar to those given 

in the case where each canon is allowed to have no more than one premise, 

In addition, we desire to know r and r . , the maximum and the 

max min 

minimum number of variables appearing in any canon. Again we desire 
to find L(k) and S(k), the lengths of the longest and the shortest 
string derivable after a proof of k steps. 



By definition, L(l) = X » the length of the longest axiom. Now 
the longest string derivable by the application of a single canon would 
be produced when the longest strings possible were assigned to 



'1* 



>x 



in the canon 



max 



: l£l A x 2 F 2 A... A x r ^ |- V^Vi, 

max max 1 2 



u) ,x. w G 
n-1 i r - 
r max 
max 



and |w cj 



o 1 



oj I = &. Such a canon may or may not exist in C, but 
max 



no canon in C produces strings growing at a faster rate. Thus we observe 
that 

LCD - i t 

L < 2 > - r max L(1 > + * 



so that 



L(k) = r L(k-l) + I 

max v ' 

lr 1 k " 2 4 

L(k) = & (r ) K_i + zJ~(r ) 1 
*o max' * *j—z ^ max' 

i, i /( r ) k_:l " 1' 
, N k-1 , s max 

' ^o (r max ) + *>[— — 

\ max 



Solving this equation for k to produce the inverse function, we obtain 



20 



/z(r - 1) + i\ > . 

-1, x , _, . / max ] where r > 1. 

L (z) - 1 + log I- . I max 

r max \ & o ' 

We now desire to find an expression for S(k), the length of the 
shortest string derivable after k applications of canons of the form 

*lll A •'• A X r#n I" V^Vij •*• Vl X i n U n £ ' 

In this canon, we must substitute strings for n variables. If the 
length of the string produced as a conclusion is to be as short as 
possible, each variable should be assigned the shortest possible string. 
However, some variable must refer to a string produced in the previous 
proof step, or there would be no reason to have that step. This would 
contradict our assumption that all proofs are of minimum length. We thus 
produce the shortest permissible result when one string x p is assigned the 
length S(k-l) and n-1 have the shortest possible length, s q . Once again, 
there may or may not be e canon which allows this, but all other sub- 
stitutions will produce longer conclusions. 

In addition, we desire to use the canon with the least number of 
variables in its conclusion. This canbn, by definition, contains constant 
strings of total length s. For sufficiently large string lengths, the 
effect of concatenating several long strings is greater than just adding 
a constant number of symbols each time. Thus we expect this analysis of 
S(k) to be valid for only large k. 

The expression we have justified for S(k) is 
S(k) = S(k-l) + 3 o (r fliin -1) + s . 
The solution of this equation is 

S(k) - (k-D(s o r min + s) - (k-2)s o , 
whose inverse is 

,-i, * z * s ° ( w z) + s • 

S (z) ■ ■■ 

s (r , - 1) + s 
o min 
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Combining these results, we obtain 

Theorem 10 . If C is a canonic system in which all canons are either 
of the form I- u or x^ A ... A xj^ h w^ u^x ... Vl^n £ ' the ° 

._/.. _i\ + i v z + s (r . -2) + s 
1 + lo g r Z(r max l) + \ t m , (C , z) * / min 

for sufficiently large ,, r^ > 1, and s q , s, JL q , and & as defined 
previously. 

The class of canonic systems yielding exactly the context-free 

languages requires canons of the form either h aG or xAAyBf- xyG, where 

a is a single symbol from an alphabet V [4] . Then s q » i Q - 1, a = ■ I - 0, 

and r = r =2, and Theorem 10 reduces to 
min max 

Corollary 2 . If C defines a context-free language in standard form, 

then 1 + log 2 z £ m'(C, z) ^ z. 

Summary 

We have defined four functions measuring the complexity of a 
derivation in a canonic system. These functions were shown to have 
certain properties, and relationships among them were derived. For 
canons in which only the identity function appears in the premises, we 
derived bounds on some of these functions in terms of the structure of 
the structure of the system. 

There is a question of interest which we have not settled: Given 
a system C, is it possible to find a system C* such that C* is equivalent 
to C, but m'(C*, I) < m'(C, I) for all A? In other words, is it possible 
to "speed up" a canonic system in the same way that a Turing machine can 
be speeded up? The answer to this question will determine the nature of 
a lower bound on our measure functions. 
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Chapter 3 - Inverse Systems and Compiling 

Introduction 

A canonic system C is a specification of the members of a set and, 
as such, gives rules for generating successive elements of that set. 
Because of its similarity to the parsing problem, the inverse process is 
more interesting: given a string u, how could it have been generated by 
C? The operation is one of analysis rather than synthesis. 

An algorithm which accepts a canonic-system description of a language 
and analyzes an input string based on this description exists [ 1] . We go 
one step further and create from C a new canonic system C* which exactly 
describes the recognition procedure for strings generated by C. 



For example, consider the language JT « ra n ba n | n * Oi defined by the 
system 

h bA 
xA\-axaA . 

A proof in this system C that aabaaA is 

bA, abaA, aabaaA . 

Consider the system 

axaA W'&A 
bAhl 

where the conclusion I indicates an arbitrary terminating condition. When 
this system is presented with an axiom |- u such that w is provable in C, it 
proceeds to decompose that string. For example, if we include the axiom 
(- aabaaA, the system becomes 

|- aabaaA 
axaA f- xA 
bA h I 

and generates the proof 

aabaaA, abaA, bA, I . 
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Apart from the end-of-proof indicator I, this is simply the proof in C 
written backwards. The system we have just presented is C , the inverse 
of C. 

Construction of c" From C 

The relationship between the proofs in (f and C suggests a procedure 
for obtaining the canons of C" from those of C: interchange the premise 
and the conclusion. Thus if C has the canon 

xA }- axaA , 

C should have 

axaA ^- xA . 

This procedure fails if the original canon had multiple premises, for the 
inversion of the general canon 

f.(x, I ... l xjFi A. ..A f (x.,...,x )F [- g(x. ,...,x )G 

it n i mi n tii i n 

would produce 

g(x, , . . . ,x )G |- f . (x. , . . . ,x )F n A. • .A f (x, OIL > 

1 n - "' ll n —1 mi n ^m 

and we do not define the meaning of a canon with multiple conclusions. 
(There is no way of indicating in a proof that several conclusions must 
hold simultaneously.) Thus only systems C in which each canon has no 
more than one premise possess an inverse c" by this construction. It is 
the case, however, that any arbitrary system can be reduced to this form, 
and this we now demonstrate. 

Normal-Form Systems 

Given an arbitrary system C, it is possible to derive a system C.. such 
that 

1. Each canon in C-, has a single premise. 

2. A remark t is provable in C iff t is provable in C. 

The general canon we must consider is 
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f-iCx. OF. A... A f (x. ,...,x )F^ |- g(x- ,...,x )G 

l l n ~ •! ml n ~~ m l n — 

where 

d. 

£ A 11 A I 

f .: A -» A 

since the canon is over the algebraic system (A, *-,...,*.). We shall 
call d. the degree of f. ; the result of applying f . is a d, -tuple of 
elements from A, and F. must be a predicate over d, -tuples. For con- 
venience, let us abbreviate this canon as simply 

f-F a... A s-'JLh S£ . 
l i 111 ui ^^ 

The following algorithm produces (L. from C: 

1. If C has either the axiom |- uG or a canon fF (- gG with a 
single premise, then retain these rules in C... 

2. If C has the canon 

f-F. A... A fF h gG , 
11 ui ui 

then let CL. have the canon 

<£-,..., f » F, .. .F j- gG 

» 

and the new predicate F- . . .F of degree (2— d.). 

1 m £-1 i 

3. If CL_ has the canons 

<r. , . . . , r > R \~ uR 1 

1 P ~ — IZ ; 

<s 1 ,...,s > S |- vSJ_ , 

^ then let CL. also have the canon 
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<r, ,...,r .s, , ...,s > RS J- <u, v> R'S f 

l P 1 q —— — — 

and the new predicates BS of degree (degree R + degree S) and 
R'S 1 of degree (degree R* + degree S'). Repeat step 3 for 
each pair R'S* describing a predicate created by rules 2 and 
3. Since C has only a finite number of canons, this process 
will terminate. 
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4. No other canons are in (L,. 

This algorithm reduces several premises to a function of higher 

degree. Rule 1 retains all those canons which already have only a single 

premise or none at all. Rule 2 reduces a canon with several premises to 

the desired form by combining the d results of each function f^^ <1 ^ i * m) 

m 
into one ( ]£I d.)-tuple. When an n-tilple is evaluated, all components must 

i»l 
be shown true, so the effect of requiring the proof of all the original 

premise remarks is achieved. When canons are combined in this way, new 

predicates are created in the premises, and Rule 3 provides the canons to 

describe these. This rule provides for the interaction between objects 

described by several canons. 

For example, let C be the system 

haA 
1-bB 
h cC 
xA jt- axA 

xB h bxB 

XC \-> CXC 

xA A yB h xyD 

*£ A yD h yxyE 

which generates 'E = CaV^aV I n,m,k .* l) . By Rule 1 of the algorithm, 
CL, contains the canons 

|-cC 

xA J- axA 
xB |-t»cB 
xC \-cxC . 

By Rule 2, G„ contains 

<x , y> AB f- xyD 



26 



corresponding to the two canons with multiple premises. Rule 3 provides 
definitions for the sets AB and CD; applying it to the canons in C r 
which define A and B separately produces the set of rules 

<x, y>AB \- ajax, by>AB 

xA \- <ax, b>AB 

yB [- <a, by>AB 

f- <a, b>AB 

The definition of CD generates the canons 

<x, y, z >ABC |- <cz, xy>CD 
oc, y>AB h <c, xy>CD 

A new predicate, ABC , has been generated, and it must be defined by 
applying Rule 3 again: 

Oc, y, z >ABC |- <ax, by, cz> ABC 

<y, z>BC |- <a, by, cz>ABC 

<*> y>Ag f <ax, by, c>ABC 

oc, z> A C h <ax, b, cz >ABC 

xA |- <ax, b, c >ABC 

yB f- <a, by, c >ABC 

zC |- <a, b, c> ABC 

The predicates BC and AC must be defined, again by Rule 3: 

<X, y5*C |- ocb, yc>BC 
xB \- <xb, c>BC 
yC \- <*, yc>BC 
<b, C5*C 

<x, ysAC H oca, yc>AC 

xA \~ <xa, c>AC 

. yC |- <b, yc>AC 

|- <*, «AC 

No new predicates were created in this step, so the generation of C is 
complete. 
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Consider the proof of aabbbcaabbbE In the original system C: 
bB, aA, bbB, aaA, bbbB, cC, aabbbD, aabbbcaabbbE . 
The system (L, simulates this derivation as: 

bB, <a, bb> AB , <aa, bbb> AB, <c, aabbb> CD, aabbbcaabbbE . 
We now show that (L. generates the same sets as C. 

Theorem 1 . A remark t is provable in C^ iff t is provable in C. 
Furthermore, the proof in (^ is effectively obtainable from that in C, 
and conversely. 

Proof . To show that t provable in C implies t provable in C^, we 
will use induction on the length of the proof in C. 

Basis: If t is provable in one step in C, then t, being an axiom, 
is also provable in one step in C^. Similarly, if t is provable in two 
steps in C, it is the result of applying a single -premise canon to an 
axiom in C. By Rule 1 forming C N from C, these operations may also be 
performed in CL.. 

Induction: Assume all theorems provable in less than or equal to 
k steps in C are also provable laC,,. A proof of length k+1 of a theorem 
t in C is a list of remarks r 1 »--«» r i c+ .i 8Ucn that 

1. r is an axiom. 

2. r, (1 < i s k+1) is either an axiom or a remark 
immediately derivable from some subset of {^ r i-i?* 

3 ' r k+l = '• 

t is not an axiom of C since it is assumed to be provable in no less 
than k+l steps. Therefore r fc+1 is immediately derived from some subset 
{(•>.£,.,...,<•> Pj of {r^...,^} by the application of a canon 

f !<*! * n )F-l A ... A f m (x r . . • ,x n )F m h gC*! * n >£ 

where there exist z^...^ e A such that f 1 (» 1> • • • » z n ) " u i and 

efz z )G - r, ... Furthermore, each w.F. is provable in less than 

ox 1 n — k+1 i-"" 1 

or equal to k steps. 
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From these facts about the proof in C, it is possible to con- 
struct the proof in C... When C derives r.... by applying the canon 

f t (x.. x )F. A ... A f (x- .... ,x )F \- g(x. .... ,x )G , 

11 n — i ml n m l n 

C„ uses 

N 

<f .,..., f >¥. ...F h gCx^-.-.x^G 
l m —l m i n — 

generated by Rule 2 forming (L. from C. Each component of the m-tuple 

is provable in less than or equal to k steps in C; by the inductive 

assumption, each is therefore also provable in CL^, as is the entire 

m-tuple. Then, as long as the predicate F, . . .F appears in the con- 

i m 

elusion of some canon, r^.. may also be proven in C^. But Rule 3 of 
the construction algorithm insures that all predicates appearing in 
premises appear on the right-hand side of some canon, so r, , is 
indeed provable in CL. 

Thus the existence of a proof of arbitrary length of t in C 
implies the existence of a proof of t in (L 

It is now necessary to show that t provable in CL. implies t 
provable in C. The proof of t in CL. is a list of remarks s.,...,s 
such that 

1. s- is an axiom. 

2. s. (1 < i s t) is immediately derivable from s.^. 

3. s„ = t. 

The following procedure reconstructs the proof of t in C. Con- 
sider, with no loss of generality, each remark s. to be of the form 



<*! %> &i":% ^ ere P g L 



In the construction of C from C, multiple premises in canons are 
reduced to components of one n- tuple. If we know the predicates in C, 
we can look at such a remark in the proof in C N and decompose the one 
term into several. That is, the remark 
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<V" q p >fi l- : -^ S P 

in C„ becomes the set of remarks 

N 

in C, where each Q. may be a predicate of arbitrary degree. Continuing 
this procedure on each remark in the proof s .,..., s of t in C produces 
the proof of t in C. Therefore t provable in C implies t provable in 
C. QED 

Measure Functions of Normal Systems 



Theorem 2 . n(C , t) - m(C N , t) for all t provable in C N> and 
r £ - m* (C N , i) for all jj. 

Proof . These are trivially true since each step in a derivation 



in C N requires the evaluation of exactly one premise. QED 

Theorem 3 . If v is the maximum number of premises per canon in C, 

m(C N , t) 

then m(C , t) <; m(C, t) <; ^ for all t provable in C. 

v - 1 

Proof . Figures 1 and 2 illustrate the correspondence between 
derivations in (Land C. We observe that each node in the C tree 
expands into at least one and at most v nodes in the C tree. Both 
trees have the same number of levels (the number of nodes in the C 
tree is equal to the length of the longest path in the C tree) , so 

that if the top level in C expands into v nodes, the next lower level 

2 
could contain as many as v.\i = v , and so on. Thus 

m(C s t) fe m(C N , t) 

mi_C , t)-l 
m(C, t) s 2_ v 
i-0 

so that 

jn(C N , t) 
m(C , t) s m(C, t) s: ' 

N v - 1 

QED 
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L k+1 




Fieurt 1 - Derivation in System C 



<u, 



<s i'V 



k+1 



»u > F. ...F 
m — 1 m 



' S o jtfl" *** '^ 



Figure 2 - Derivation in System C N 
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Inverses of Arbitrary Systems 

Recall that normal-form systems were introduced to show that any 
canonic system C can be reduced to another system C„ in which each canon 
has only one premise. The difficulty in constructing the inverse system 
C from an arbitrary C arose exactly in the consideration of canons with 
multiple premises. The normal- form theorem resolves this problem by 
always providing an equivalent system of the required form. We may now 
formally define an inverse system. 

C is constructed from a system C in normal form by the following 
algorithm: 

1. If C has the axiom h^. then let c" have the canon 

cjG h I, where I is an arbitrary end-of-proof indication. 
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If C has the canon f(x.,...,x )F \- 8<x. , . ..,x )G, then let 
, I n -" i n 

C" have the canon g(x- x )G (- f (x. , . . . >*_).£• 

Let C u {a} denote the system C with the added canon a. The inverse 
system has the following property. 

Theorem 4 . t is provable in C iff I is provable in C U { (- t) • 
Furthermore, the proof in C - is effectively obtainable from that in C, 
and conversely. 

Proof . If t is provable in C, then there exists a proof r-,...,^ 
of t In CL such that 

1. r. is an axiom. 

2. r (1 < i £ k) is immediately derivable from **-,' 

3. r. - t. 

k 

If the axiom |-t is given to c" , the only proof possible must begin t,... 
since C - has no other axioms. In the proof of t is C M , t « r, is 



J N' k 



derived from r, 1 by applying a canon 



f(x 1 ,...,x n )F |- g(x 1 ,...,x n )G 

where there exist z.,...,z e A such that £(z.,...,z )F « r . and 

J. n i. n — k- j. 



■^ 
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g(z.,...,z )G " r, . By the construction of C - from CL., the inverse canon 

g'(x 1 ,...,x n )G h Hx l ,...,x n )F 

is available in C , so the proof in C* can continue: t, r, .,... . This 
process may be repeated until the axiom r. is reached: t, r, _., . . . , r.. . 
Then, since the canon r. (- I exists in the inverse system if |- r, was an 
axiom in C„, the proof terminates t,r,_. , . .. ,r. ,1 , proving I. 

If I is provable in C - U { h t} , then there exists a proof 

t,s.. s, -,I in that system. The last canon applied must have been 

s, . h I, since these are the only canons which generate I. By the con- 

k " L - 1 

struction of C from C , |- s, 1 must have been an axiom of CL. At each 

step in the proof where s, +1 is deduced from s. by the application of 

a canon 

g(x- .... ,x )G |- f (x. , . . . ,x )F , 

i n ~* i n 

the inverse rule 

f(x 1> ...,x n )F \- g(x 1 ,...,x n )G 

exists in Cv, and may be applied. After k-1 steps, we arrive at a proof 
s k-l'"*' s l in Si* A final application of the inverse of the canon which 
produced t from s.. produces the desired proof. QED 

Measure Functions of Inverse Systems 

The proof of Theorem A can be extended to indicate the relationship 
between the length of the proof in C - and that in C N . 

Theorem 5 . m(C" U { (- t] , I) = mCCL, t) + 1 for all t provable in C^. 

Proof. The result follows immediately from a consideration of the 
construction of Theorem 4. The proof in the inverse system is simply the 
reverse, remark by remark, of the proof in CL, plus the end-of-proof signal 
I. QED 

Corollary 1 . m'CC -1 U { J- t} , |l| ) - m'C^, |t |) + 1 for all t provable 
in C^. 
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Proof . Immediate. 

An interpretation of these results is beneficial at this point. The 
corollary states that if t is known to be provable in C^ in no more than 
k = m'CCj,, |t|) steps, then the inverse system C - , given the single axiom 
|- t, can reproduce the proof in no more than k+1 steps. 

Theorem 3 of this chapter states that m'<(^, I) * m'(C, Z) for all I. 
This fact and Corollary 1 produce 

Corollary 2 . m'Of 1 Ufht], |I|) * m»<C, |t|) + 1 for all t provable 
in C. 

Proof . Immediate. 

Inverse Systems As Recognizers 

Suppose the canonic system C describes a programming language. The 
system C' 1 behaves like an analyzer for that language since, given any 
string derivable in C, C _1 can produce a parse of that string. The set 
of all strings derivable in C is simply the set of all legal programs in 
the language described by C. Once the m» function is known for C, it is 
possible to characterize the recognition process of a -legal program in 
that language by applying Corollary 2. 

While m'(C, I) + 1 is an upper bound on the number of steps in the 
parse in c'\ nothing we have said helps construct this parse. C" , like 
any canonic system, is a non-deterministic statement of transformations 
which may. be applied at any step in a proof. In general, more than one 
canon may be applicable, and the correct rule to choose may not be im- 
mediately obvious. 

Recursive Sets And Programmin g Languages 

* Canonic systems in their most general form define recursively 
enumerable sets. Given a particular string u and system C, it is not 
possible^tfiuiietermine in general whether u could eventually be generated 
by C. A recognizer based on a recursively enumerable language may never 
halt for some input strings, and this situation is undesirable for 
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practical computing systems. 

One solution is to allow only language definitions which define 
recursive sets. It is undecidable, however, whether an arbitrary canonic 
system has that property (Appendix I). Another approach is to restrict 
the form of the system so that its language is known to be recursive. 
The correspondence with Chomsky's grammars does this, but the forms pro- 
duced do not resemble the definitions of programming languages. 

Let us examine the question from a different point of view: we are 
trying to insure that a compiler presented with an invalid program will 
not loop endlessly. In theory, a machine capable of examining any 
arbitrary string and deciding acceptance or rejection must be based on 
a recursive language. In practice, compilers are not usually asked to 
process completely arbitrary inputs. A source program may be correct 
except for the duplication of a statement label or a missing declaration. 

It is possible then to write the language description in such a way 
that common errors are detected. In effect, we are defining a new language, 
similar to the desired one except for well-defined errors. We still cannot 
guarantee our compiler will halt for arbitrary inputs, but "almost cor- 
rect" programs will not cause infinite looping. 

For example, consider generating DO loops in Fortran of the simplified 
form 

DO <label> <integer variable> = <constant>, <constant> 



<label> CONTINUE 

Most Fortran systems impose the restriction that the value of the DO 
index may not be changed within the loop; that is, 

DO 5 J - 1, 10 
J = 20 
5 CONTINUE 

is illegal. Assume we have defined in a canonic system the predicate 
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not in such that <x, y> not in is true iff the element x is not contained 
in the set y, and <x, y> in if it is. A specification for a legal DO 
loop is: 

<x, y> set of Fortran statements with assigned variables 
A z integer variable A <z, y> not in A w statement label 
A i integer constant A j integer constant 

DO w z - i, j 
[— x legal DO loop . 

w CONTINUE 

To detect the case of a program redefining the value of the index, we 
might write: 

<x, y> set of Fortran statements with assign ed variables 
A z integer variable A <z, y> in A w statement label 
A i integer constant A j integer constant 

DO w z = i, j 
|_ x program attempting to redefi ne DO index . 

w CONTINUE 

Thus a program with a syntactic error is recognized as such, and 
we may be assured that the compiler will not loop on receiving an incor- 
rect DO loop. The language defined by the entire canonic system may not 
be recursive in the formal sense, yet we may be able to determine member- 
ship in some set for most input strings we are likely to encounter. The 
language description might be called "almost recursive," and a recognizer 
based on that description would halt for "almost all" inputs. 

Summary 

This chapter has presented two main results. First, we showed how 
an arbitrary canonic system C could be reduced to an equivalent system C^ 
which contained only single-premise canons. From this normal form, we 
developed the idea of an inverse system C" which characterized the 
recognition of strings generated by C. In addition, we computed the 
measure functions of both C^ and C _1 and showed the relationship between 
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them. Finally, we developed a technique for developing 'almost recursive" 
language descriptions so that recognizers would not be l:\kely to loop on 
most inputs. 

Inverse systems as we have defined them do not specify parsing 
algorithmns for the languages they recognize. An open question is whether 
inverse systems can be so extended, and under what conditions. 



37 



Appendix I - Systems Describing Recursive Sets 

Introduction 

A canonic system can specify the elements of any recursively 
enumerable set. However, from at least a theoretical point of view, it 
is desirable to know under which conditions the set generated is also 
recursive. Doyle [4] demonstrated equivalences between canonic systems 
generating sets of strings and the formal languages of Chomsky [ 2 ]. 
Since type 1, 2, and 3 languages are recursive sets, the related 
canonic systems also generate recursive sets. 

In this section we consider canonic systems generating set of 
natural numbers. It is difficult to describe functions of strings, 
but any operation on natural numbers can be characterized by some 
partial recursive function. First, recall three definitions. 

A set Q is recursively enumerable if and only if it is the range 
of some total recursive function. Equivalently, there exists some 
Turing machine which will generate all members of Q in some order. 

The characteristic function C (x) of a set Q is defined to be 



C Q (x) = 



l if x« 



• 



if x^Q. 



A set Q is recursive if and only if its characteristic function is 
total recursive. In other words, there exists a Turing machine which, 
given any x, will decide either x e Q or x^Q. A set is recursive if and 
only if both it and its complement are recursively enumerable. 

Canonic Systems and Recursive Sets 

Let N be the set of natural numbers, and consider canons over the 
algebraic system (N, +) . We now prove two theorems relating canonic 
systems and recursive sets of natural numbers. 
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Theorem 1 . If a set Q s N is recursive, then it can be generated 
by a canonic system in which the only canon describing Q is of the form 

/(x)P h x£ 
where /(x) is a total recursive function. 

Proof . Since Q is recursive, its characteristic function 
r 



c Q <*> - \ 



1 if x e Q 



s if x^Q 

is total recursive. Let /(x) - C Q (x), and include the axiom KIP. Then 
the conclusion x£ in the canon 

C Q (x)P h *9. 

is provable if and only if x e N and C (x) = 1, since only then is the 
second premise C Q (x)P satisfied. Therefore xQ. is provable if and only 
if x e Q, as desired. The entire canpnic system generating A is then 

V IP 
c Q (x)P .h x 2 

QED 

The converse of this theorem is true, but a stronger result is 
possible. 

Theorem 2 . Let the set £ be defined only by canons of the form 
f !<*!.,- ...* n >£l A -'- Af m (x l'---' X n ) E m V 8<V •■• •*„>£■ 

Then Gf is recursive if 

1. g exists and is a total function 

2. f .,..., f are total 

1 m 

3 f $ are recursive sets 

J. * 1 ,...,r m 

4. g(x ,...,x ) depends on each variable x £ in a non-trivial 
way. 
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Proof . Assume, with no loss of generality, that the range of g 
is the set of k-tuples z = ^ . . . ,z^>. To show that ff is recursive, 
we show how to determine whether z& or z0 for an arbitrary z. 

If g" 1 exists and is total, for an arbitrary k-tuple z = <z 1 , . . . ,z k > 

we can compute g" 1 ^ ^ = o^,...^. Every variable that appears 

in the premises is some x ■ in g -1 (z) = <* v . . . ,x n > since g is assumed to 

depend upon all the variables non-trivially. Then, for each function 

£, ;...,f , we can compute f . (x n ,... ,x ) since each f is total. Then, 

1 m i i n i. 

knowing the values of f (x.^ . . . ,x n ) , we can determine membership in 

?,,..., F* since these are recursive sets. 
1 m 

Formally, the characteristic function of £ is computable for all 



z as 



C^(z) = C r (f^l-.-.x^). ... -C r (f m ( Xl ,.. ,x n )) 

where g il (z) = <* v . . . ,x > and C ? ,...,Cg. are the characteristic 

1 m 

functions of the recursive sets F^ , . . . ,f . Since C~(z) is total 

i m *» . 

recursive, Sf is a recursive set. QED 

i 
The General Decision Problem 

Theorems 1 and 2 relate certain canonic systems and recursive 
sets. This is not to say that recursive sets can onlv. be described by 
systems of those forms, nor that any system with canons of another form 
can not generate a recursive set. We wish to sketch two results of 
Mandl [ 5 ] which explain the apparent inadequacy of our structure 
theorems. 

Theorem 3 . It is undecidable whether an arbitrary canonic system 
generates a recursive set. 

Proof . A canonic system defines, in general, a recursively 
enumerable set, and it is undecidable whether an arbitrary recursively 
enumerable set is also recursive. QED 



40 



Theorem 4 . There is no class of canonic systems such that 

1. Every system in the class generates a recursive set. 

2. Every recursive set is generated by some member of the 
class. 

Proof . The proof requires a diagonalization argument which we 
do not wish to reproduce here. 

Summa ry 

For canonic systems generating sets of natural numbers, we have 
derived restricted forms of canons which guarantee that the system will 
generate a recursive set. The general question of deciding whether an 
arbitrary system generates a recursive set has already been shown un- 
decidable. 
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Appendix II - Equi valent Systems 



Introduction 



In previous papers on canonic systems, only canons describing functions 
of strings have been considered. That is, the algebraic system has been 
assumed to be the monoid (V*,0 of all finite strings over an alphabet V 
under the operation of concatenation. Under these conditions, the 
functions involved in the definition of a canon take on a particularly 
simple form. 

We desire the most general function f^,..-.^) from n-tuples of 
strings into k-tuplee. Since concatenation of variables and constants is 
the only operation available, that function is 



Id , > 



^^.-'V^OlVll-VV' "" ' Ok Z lk W lk-Vn k k' 

where z . e {x. , . . . ,x } and u> « V*. For example, a function over 

ij L 1 n J r J , 

fa, b, c]* from ordered pairs into ordered pai.rs might be 

f(x, y) - <abcxy, axbycx>. 

Other authors have defined restricted forms of this general function, 
such as one in which each variable x^...^ is used exactly once. We 
desire to investigate some of these forms and show the relationship 
between them. The effect of this is to allow us to rewrite a system in 
an equivalent form which may be easier to manipulate in a particular 
instance . 

Predicates of Degree k to Indicated Context 

Definition 1 . A canon with Indicated context is a canon in which 
at least one function f.^,..,^) in the premises contains a concatenation 
of a constant string and a variable. That is, 

fi^r-^'ViAi-VV' '" ,u, ok z ik u) ik--- z n k k\k> 

where z.. e {^,...,^1, w y8 V*, and at least one w.. is not the empty 
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s tr ing . 

This designation arises because the canon may be applied only if 
the context is matched. For example, the canon xaA A y B |- <x, y>C 
requires that a string ending in "a" be in K. 

Theorem 1 . Any predicate of degree k > 1 can be reduced to a 
predicate of degree L and indicated context. 

Proof . A remark involving a predicate of degree k > 1 is of the 

* n * 
form <s lS s , ...,s >P, where the range of each s . : (V ) -» V is a single 

string. Choose a symbol X not in V, and create a predicate Q of degree 

1 which contains the information in P in the form s^Xs^,. . .Xs fc 2- Wherever 

the quantity <s ,s , ...,s >P appears, replace it by SjXSj,. . .Xs k C;, which 

is of degree 1 and uses indicated context. QED 

Corollary 1 . Any canonic system can be reduced to one in which no 
predicate is of degree greater than one, if indicated context is allowed. 

Proof . Apply the theorem to all predicates of degree greater than 
one. 

The converse of Theorem 1 is true, but the resulting predicate of 
degree k may still require indicated context. (A trivial proof just 
changes every predicate of degree one into degree two and makes the 
information in each coordinate indentical.) The question to ask is whether 
the indicated context can be replaced by a simpler operation. This does 
not appear to be possible, as an example will show. Consider the canon 
xayA |- xayB, which has the effect of placing in B all those strings in 
A which contain an "a" anywhere. If we attempt to remove the indicated 
context and write something like <x, a, y>A, we have lost the ability to 
search for the "a" within the string xay. As a matter of fact, it 
becomes necessary to allow terms of the form xy, where the division of 
a string into its x- and its y-component becomes completely arbitrary. 
For example, to simulate a premise remark of xayA a ■••> we must write 

xA (- <x, \, \>B 

<wx, y, z>B |_ <JJ, xy, z>g. 
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<jo, xy, z>B f- <ji), x, yz>B 
<*, a, y>B a ••• 

where B is a temporary predicate introduced only to allow the searching 
from left to right. Since we feel such a canonic system is no simpler 
than one allowing indicated context, we shall not attempt to state a 
converse to the theorem. 

Relation to Post Systems 

At this point, we wish to demonstrate the equivalence between 
canonic systems and Post systems. Although canonic systems 
evolved from Post systems, an exposition of the process has never been 
made. 

A Post system is a set of rewriting rules of the form 

a i X l a 2V ' ' Wn+1 - Pl y l P 2 y 2' • "W^l 

where a. and p. are constant strings over a finite alphabet V, and 
y.g {x-,.'..,x }. For example, the rule 

xlly - x22yy 

would change the string 333114 to 3332244, if x . - 333 and y = 4. The 
left-hand part of a rule is called the antecedent , and the right the 
consequent . 

The distinguishing feature between Post and canonic systems is that 
a Post system allows only one workspace. All its rules refer to the 
same string, while a canonic system allows many such workspaces - one 
corresponding to each predicate. For example, the Post system geperating 
all strings of a's of even length is: 

axiom' : aa 
rule : x -» aax 

The analagous canonic system becomes 
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\- aaA 
xA |- aaxA 

No more computing power can be added to a Post system. It is simply 
often more convenient to specify a process using the notation of canonic 
systems. 

Theorem 2 . Any Post system oan be simulated by a canonic system 
using predicates of degree one. 

Proof . If the Post system has the axiom w, let the canonic system 
have the rule h <*W. If the Post system has the production 

a i X i a 2 X 2' • -Wn+l - P l y l P 2 y 2- ' 'WmH » 
let the canonic system have the rule 

a i X l a 2 x 2 • • 'WW* *" *1 W2 " ' ' W»H W -' 

Then the set*?? describes exactly the strings the Post system generates. 
QED 

Observe that, in general, canons with indicated context will be 
required. The equivalence in the opposite direction is of interest 
only to show the construction required. 

Theorem 3 . Any canonic system can be simulated by a Post system. 

Proof . We will only indicate the procedure involved in the 
simulation, since the details are tedious. We let the Post system 
build up a proof in much the same way that higher level systems do 
(Chapter 1). Markers must be left in the workspace to separate each 
remark in the proof, and non-determinism must be allowed so that any 
canon can "reach" arbitrarily far back for remarks proven previously. 

Reduction of Indicated Context to Cross Referencing 
Definition 2 . A canon 

^(^ ^Il A *•• A y x l'** , ' X n>£ m *~ 8(x 1 ....fX n >S 
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and z., e {x ,,..., x }, w ± , e V* is said to contain cross referencing if 
not all the z. are distinct variables; that is, z - z q does not imply 
that p = q for all p and q. 

Theorem 3 . Any canon using indicated context may be reduced to a 
rule without indicated context if cross referencing is allowed. 

Proof . Consider a premise term of the general form 

u 0W2-Vl Z r U r- 

where the oo. are constant strings indicating contest, and the z ± are 
variables. Form the r+1 new canons 



where the A. are new, distinct predicates. The intent of the original 
premise term can be simulated by 

X (A> A X l-1 A '•• A *A A Vl X lV'- X r-l z r x r ^ ' 

Since each A. contains exactly one element by construction, the effect 

of evaluating these premises is identical to that of applying the original. 

2ED 

Summary 

We have shown that successive reductions from predicates of degree 
k to indicated context and from indicated context to cross referencing 
are possible. These manipulations indicate some of the relationships 
between canons of various types. 
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